AITopics | Mecca

Collaborating Authors

Mecca

SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback

Gao, Jingsheng, Li, Linxu, Li, Weiyuan, Fu, Yuzhuo, Dai, Bin

arXiv.org Artificial IntelligenceOct-22-2024

RAG systems consist of multiple modules to work together. However, these modules are usually separately trained. We argue that a system like RAG that incorporates multiple modules should be jointly optimized to achieve optimal performance. To demonstrate this, we design a specific pipeline called SmartRAG that includes a policy network and a retriever. The policy network can serve as 1) a decision maker that decides when to retrieve, 2) a query rewriter to generate a query most suited to the retriever, and 3) an answer generator that produces the final response with/without the observations. We then propose to jointly optimize the whole system using a reinforcement learning algorithm, with the reward designed to encourage the system to achieve the best performance with minimal retrieval cost. When jointly optimized, all the modules can be aware of how other modules are working and thus find the best way to work together as a complete system. Empirical results demonstrate that the jointly optimized SmartRAG can achieve better performance than separately optimized counterparts. Although large language models(LLMs) (Chowdhery et al., 2023; Touvron et al., 2023; Chung et al., 2024) have demonstrated exceptional capabilities across various domains, addressing knowledgerelated issues beyond model parameters remains a challenging task (Mallen et al., 2023b; Min et al., 2023). Retrieval-augmentation generation(RAG) effectively enhances model performance in these scenarios by retrieving additional information from external tools (Ram et al., 2023). RAG systems usually consist of multiple modules including at least a retriever and a generator. Some systems may have other modules like a reranker (Glass et al., 2022), a decision maker deciding when to retrieve (Jeong et al., 2024; Wang et al., 2023a), a query rewriter (Ma et al., 2023; Tan et al., 2024) or a verifier (Lewis et al., 2020; Izacard et al., 2023). These modules are often hand-designed and separately optimized. One of the issues is that the golden answer of the intermediate modules are usually not accessible. What is worse, sometimes the golden answer is model-dependent or retriever-dependent. For example, Asai et al. (2024) uses the result of GPT4 (Achiam et al., 2023) as the ground truth for the decision maker, which can be suboptimal.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.18141

Country:

Europe > Ireland (0.04)
South America > Colombia (0.04)
South America > Brazil (0.04)
(25 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Media > Film (1.00)
Government (1.00)
Leisure & Entertainment > Sports > Basketball (0.67)
Banking & Finance (0.67)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Movement Control of Smart Mosque's Domes using CSRNet and Fuzzy Logic Techniques

Blasi, Anas H., Lababede, Mohammad Awis Al, Alsuwaiket, Mohammed A.

arXiv.org Artificial IntelligenceOct-13-2024

Mosques are worship places of Allah and must be preserved clean, immaculate, provide all the comforts of the worshippers in them. The prophet's mosque in Medina/ Saudi Arabia is one of the most important mosques for Muslims. It occupies second place after the sacred mosque in Mecca/ Saudi Arabia, which is in constant overcrowding by all Muslims to visit the prophet Mohammad's tomb. This paper aims to propose a smart dome model to preserve the fresh air and allow the sunlight to enter the mosque using artificial intelligence techniques. The proposed model controls domes movements based on the weather conditions and the overcrowding rates in the mosque. The data have been collected from two different resources, the first one from the database of Saudi Arabia weather's history, and the other from Shanghai Technology Database. Congested Scene Recognition Network (CSRNet) and Fuzzy techniques have applied using Python programming language to control the domes to be opened and closed for a specific time to renew the air inside the mosque. Also, this model consists of several parts that are connected for controlling the mechanism of opening/closing domes according to weather data and the situation of crowding in the mosque. Finally, the main goal of this paper has been achieved, and the proposed model has worked efficiently and specifies the exact duration time to keep the domes open automatically for a few minutes for each hour head.

machine learning, mosque, programming language, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.14569/issn.2156-5570

2410.18123

Country:

Asia > China > Shanghai > Shanghai (0.25)
Asia > Middle East > Saudi Arabia > Medina Province > Medina (0.24)
Asia > Middle East > Saudi Arabia > Mecca Province > Mecca (0.24)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (0.69)
Education > Educational Setting (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Software > Programming Languages (0.87)

Add feedback

Forcing Diffuse Distributions out of Language Models

Zhang, Yiming, Schwarzschild, Avi, Carlini, Nicholas, Kolter, Zico, Ippolito, Daphne

arXiv.org Artificial IntelligenceApr-16-2024

Despite being trained specifically to follow user instructions, today's language models perform poorly when instructed to produce random outputs. For example, when prompted to pick a number uniformly between one and ten Llama-2-13B-chat disproportionately favors the number five, and when tasked with picking a first name at random, Mistral-7B-Instruct chooses Avery 40 times more often than we would expect based on the U.S. population. When these language models are used for real-world tasks where diversity of outputs is crucial, such as language model assisted dataset construction, their inability to produce diffuse distributions over valid choices is a major hurdle. In this work, we propose a fine-tuning method that encourages language models to output distributions that are diffuse over valid outcomes. The methods we introduce generalize across a variety of tasks and distributions and make large language models practical for synthetic dataset generation with little human intervention.

biography, diversity, language model, (14 more...)

arXiv.org Artificial Intelligence

2404.10859

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
(22 more...)

Genre: Research Report (1.00)

Industry:

Government (0.68)
Energy (0.68)
Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning

Zhang, Gengyuan, Zhang, Yurui, Zhang, Kerui, Tresp, Volker

arXiv.org Artificial IntelligenceDec-29-2023

Vision-Language Models (VLMs) are expected to be capable of reasoning with commonsense knowledge as human beings. One example is that humans can reason where and when an image is taken based on their knowledge. This makes us wonder if, based on visual cues, Vision-Language Models that are pre-trained with large-scale image-text resources can achieve and even outperform human's capability in reasoning times and location. To address this question, we propose a two-stage \recognition\space and \reasoning\space probing task, applied to discriminative and generative VLMs to uncover whether VLMs can recognize times and location-relevant features and further reason about it. To facilitate the investigation, we introduce WikiTiLo, a well-curated image dataset compromising images with rich socio-cultural cues. In the extensive experimental studies, we find that although VLMs can effectively retain relevant features in visual encoders, they still fail to make perfect reasoning. We will release our dataset and codes to facilitate future studies.

generative vlm, reasoning, vlm, (15 more...)

arXiv.org Artificial Intelligence

2307.06166

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Asia > China (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(18 more...)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Muneer Mujahed Lyati: Muneer M. Lyati

#artificialintelligenceOct-5-2021, 01:10:30 GMT

Muneer Lyati is an engineer and mechanic from Saudi Arabia. Muneer Lyati was born in Mecca, Saudi Arabia, on November 16, 1982. He received a bachelor's degree in engines and vehicles from Jeddah College of Technology. Muneer Lyati strives to be a trustworthy engineer who delivers professional results to all of his customers. He became one of Saudi Arabia's most sought-after engine and vehicle specialists thanks to his extensive mechanical engineering background and strong management and communication skills.

lyati, muneer lyati, muneer mujahed lyati, (1 more...)

#artificialintelligence

Country:

Asia > Middle East > Saudi Arabia > Mecca Province > Mecca (0.34)
Asia > Middle East > Saudi Arabia > Mecca Province > Jeddah (0.34)

Industry: Automobiles & Trucks (0.40)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Modeling indoor-level non-pharmaceutical interventions during the COVID-19 pandemic: a pedestrian dynamics-based microscopic simulation approach

Xiao, Yao, Yang, Mofeng, Zhu, Zheng, Yang, Hai, Zhang, Lei, Ghader, Sepehr

arXiv.org Artificial IntelligenceJun-18-2020

Mathematical modeling of epidemic spreading has been widely adopted to estimate the threats of epidemic diseases (i.e., the COVID-19 pandemic) as well as to evaluate epidemic control interventions. The indoor place is considered to be a significant epidemic spreading risk origin, but existing widely-used epidemic spreading models are usually limited for indoor places since the dynamic physical distance changes between people are ignored, and the empirical features of the essential and non-essential travel are not differentiated. In this paper, we introduce a pedestrian-based epidemic spreading model that is capable of modeling indoor transmission risks of diseases during people's social activities. Taking advantage of the before-and-after mobility data from the University of Maryland COVID-19 Impact Analysis Platform, it's found that people tend to spend more time in grocery stores once their travel frequencies are restricted to a low level. In other words, an increase in dwell time could balance the decrease in travel frequencies and satisfy people's demand. Based on the pedestrian-based model and the empirical evidence, combined non-pharmaceutical interventions from different operational levels are evaluated. Numerical simulations show that restrictions on people's travel frequency and open-hours of indoor places may not be universally effective in reducing average infection risks for each pedestrian who visit the place. Entry limitations can be a widely effective alternative, whereas the decision-maker needs to balance the decrease in risky contacts and the increase in queue length outside the place that may impede people from fulfilling their travel needs.

artificial intelligence, epidemic, intervention, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.tranpol.2021.05.004

2006.10666

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Maryland > Prince George's County > College Park (0.04)
North America > United States > California (0.04)
(6 more...)

Genre: Research Report > Experimental Study (0.62)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.46)

Add feedback

Artificial Intelligence Conferences 2020 in Worldwide

#artificialintelligenceJan-9-2020, 12:34:00 GMT

emerging trend, international conference, science and technology, (17 more...)

#artificialintelligence

Country:

Asia > Middle East > Syria > Damascus Governorate > Damascus (0.07)
Asia > Middle East > Bahrain > Capital Governorate > Manama (0.06)
Oceania > New Zealand > South Island > Canterbury Region > Christchurch (0.06)
(56 more...)

Technology: Information Technology > Artificial Intelligence (0.86)

Add feedback

New facial recognition technology caught 'imposter' using someone else's passport, US officials say

The Independent - TechAug-24-2018, 21:58:53 GMT

A new facial recognition technology caught a man trying to enter the US using a passport belonging to someone else, US officials say. Officials with the US Customs and Border Protection (CBP) and the Office of Field Operations (OFO) intercepted a 26-year-old man, the agencies referred to as an "imposter", who reportedly attempted to use a French passport belonging to someone else, at Washington's Dulles International Airport. The man was travelling to the US from Brazil. "The officer utilised CBP's new facial comparison biometric technology which confirmed the man was not a match to the passport he presented," the CBP press release read. It added: "A search revealed the man's authentic Republic of Congo identification card concealed in his shoe."

artificial intelligence, province, soccer coach, (15 more...)

The Independent - Tech

Country:

Asia > South Korea (0.48)
Asia > North Korea (0.28)
South America > Brazil (0.24)
(42 more...)

Genre: Press Release (0.34)

Industry:

Transportation > Air (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (0.61)

Add feedback